基础RAG实现 #

本文档中引用的文件

目录 #

简介
项目结构
核心组件
架构概览
详细组件分析
依赖关系分析
性能考虑
故障排除指南
结论

简介 #

LangGraphGo的基础RAG（检索增强生成）实现提供了一个简单而强大的框架，用于构建基于文档检索的问答系统。该实现遵循经典的"检索->生成"模式，通过向量数据库存储文档嵌入，使用语义相似性进行检索，并利用大型语言模型生成准确的答案。

基础RAG是最简单的RAG实现形式，适合快速原型开发、简单的问答系统以及高质量文档集合的应用场景。它提供了清晰的架构设计和易于理解的代码结构，是学习RAG概念的理想起点。

项目结构 #

基础RAG实现的核心文件组织如下：

graph TD
subgraph "示例应用"
A[examples/rag_basic/main.go] --> B[examples/rag_basic/README.md]
end
subgraph "预构建组件"
C[prebuilt/rag.go] --> D[prebuilt/rag_components.go]
C --> E[prebuilt/rag_test.go]
end
subgraph "核心接口"
F[Document] --> G[Embedder]
F --> H[VectorStore]
F --> I[Retriever]
F --> J[LLM]
end
A --> C
C --> F
D --> G
D --> H
D --> I
C --> J

图表来源

[examples/rag_basic/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/main.go#L1-L155)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L1-L392)
[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L1-L333)

章节来源

[examples/rag_basic/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/main.go#L1-L155)
[examples/rag_basic/README.md](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/README.md#L1-L51)

核心组件 #

RAGState数据结构 #

RAGState是贯穿整个RAG管道的状态容器，负责在各个节点之间传递数据。它包含了查询、文档、上下文和答案等关键信息：

classDiagram
class RAGState {
+string Query
+[]Document Documents
+[]Document RetrievedDocuments
+[]DocumentWithScore RankedDocuments
+string Context
+string Answer
+[]string Citations
+map[string]interface Metadata
}
class Document {
+string PageContent
+map[string]interface Metadata
}
class DocumentWithScore {
+Document Document
+float64 Score
}
RAGState --> Document : "contains"
RAGState --> DocumentWithScore : "contains"

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L58-L67)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L12-L16)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L41-L45)

RAGConfig配置结构 #

RAGConfig提供了灵活的配置选项，允许用户自定义RAG系统的各个方面：

配置项	类型	默认值	描述
TopK	int	4	检索的文档数量
ScoreThreshold	float64	0.7	最小相关性分数阈值
UseReranking	bool	false	是否使用重排序
UseFallback	bool	false	是否使用备用搜索
SystemPrompt	string	默认提示	LLM系统提示
IncludeCitations	bool	true	是否包含引用
MaxTokens	int	1000	最大生成令牌数
Temperature	float64	0.0	生成温度

章节来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L69-L91)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L93-L104)

架构概览 #

基础RAG管道采用简单的线性架构，包含两个主要阶段：检索和生成。

flowchart TD
A["查询输入<br/>Query"] --> B["检索节点<br/>Retrieve"]
B --> C["生成节点<br/>Generate"]
C --> D["输出结果<br/>Answer"]
subgraph "检索阶段"
B --> E["向量相似性搜索"]
E --> F["Top-K文档选择"]
end
subgraph "生成阶段"
C --> G["上下文构建"]
G --> H["LLM调用"]
H --> I["答案生成"]
end
style A fill:#e1f5fe
style D fill:#e8f5e8
style B fill:#fff3e0
style C fill:#fce4ec

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L125-L146)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L263-L275)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L322-L356)

详细组件分析 #

RAGPipeline构建过程 #

BuildBasicRAG方法是构建基础RAG管道的核心，它验证必要的组件并设置节点连接：

sequenceDiagram
participant Client as "客户端"
participant Pipeline as "RAGPipeline"
participant Graph as "MessageGraph"
participant Config as "RAGConfig"
Client->>Pipeline : BuildBasicRAG()
Pipeline->>Config : 检查Retriever
Config-->>Pipeline : 验证成功
Pipeline->>Config : 检查LLM
Config-->>Pipeline : 验证成功
Pipeline->>Graph : AddNode("retrieve", retrieveNode)
Pipeline->>Graph : AddNode("generate", generateNode)
Pipeline->>Graph : SetEntryPoint("retrieve")
Pipeline->>Graph : AddEdge("retrieve", "generate")
Pipeline->>Graph : AddEdge("generate", END)
Graph-->>Pipeline : 构建完成
Pipeline-->>Client : 返回nil

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L125-L146)

检索节点实现 #

检索节点负责从向量存储中查找与查询相关的文档：

flowchart TD
A["接收RAGState"] --> B["提取Query字段"]
B --> C["调用Retriever.GetRelevantDocuments"]
C --> D["执行相似性搜索"]
D --> E["返回Top-K文档"]
E --> F["更新RetrievedDocuments"]
F --> G["更新Documents字段"]
G --> H["返回更新后的RAGState"]
style A fill:#e3f2fd
style H fill:#e8f5e8

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L263-L275)

生成节点实现 #

生成节点将检索到的文档转换为LLM可理解的上下文，并生成最终答案：

flowchart TD
A["接收RAGState"] --> B["构建上下文文本"]
B --> C["遍历Documents"]
C --> D["提取源信息"]
D --> E["格式化文档内容"]
E --> F["拼接上下文部分"]
F --> G["生成完整提示"]
G --> H["构造消息内容"]
H --> I["调用LLM.GenerateContent"]
I --> J["提取回答内容"]
J --> K["更新Answer字段"]
K --> L["返回更新后的RAGState"]
style A fill:#e3f2fd
style L fill:#e8f5e8

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L322-L356)

向量存储和检索机制 #

系统使用内存向量存储来实现高效的相似性搜索：

classDiagram
class InMemoryVectorStore {
+[]Document documents
+[][]float64 embeddings
+Embedder embedder
+AddDocuments(ctx, docs, embeddings) error
+SimilaritySearch(ctx, query, k) []Document
+SimilaritySearchWithScore(ctx, query, k) []DocumentWithScore
}
class VectorStoreRetriever {
+VectorStore VectorStore
+int TopK
+GetRelevantDocuments(ctx, query) []Document
}
class MockEmbedder {
+int Dimension
+EmbedDocuments(ctx, texts) [][]float64
+EmbedQuery(ctx, text) []float64
+generateEmbedding(text) []float64
}
InMemoryVectorStore --> MockEmbedder : "uses"
VectorStoreRetriever --> InMemoryVectorStore : "uses"

图表来源

[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L94-L333)

章节来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L125-L146)
[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L263-L356)
[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L94-L333)

依赖关系分析 #

RAG系统具有清晰的分层架构，各组件职责明确：

graph TB
subgraph "应用层"
A[main.go] --> B[RAGPipeline]
end
subgraph "配置层"
B --> C[RAGConfig]
C --> D[DocumentLoader]
C --> E[TextSplitter]
C --> F[Embedder]
C --> G[VectorStore]
C --> H[Retriever]
C --> I[Reranker]
C --> J[LLM]
end
subgraph "组件层"
G --> K[InMemoryVectorStore]
H --> L[VectorStoreRetriever]
F --> M[MockEmbedder]
I --> N[SimpleReranker]
D --> O[StaticDocumentLoader]
E --> P[SimpleTextSplitter]
end
subgraph "接口层"
Q[DocumentLoader] -.-> O
R[TextSplitter] -.-> P
S[Embedder] -.-> M
T[VectorStore] -.-> K
U[Retriever] -.-> L
V[Reranker] -.-> N
end

图表来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L1-L392)
[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L1-L333)

章节来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L1-L392)
[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L1-L333)

性能考虑 #

向量相似性计算优化 #

系统使用余弦相似度计算文档间的相似性，这是一种高效且广泛使用的度量方法。对于大规模文档集合，可以考虑以下优化策略：

批量处理: 一次性处理多个查询的嵌入计算
索引优化: 实现更高效的向量索引结构
缓存机制: 缓存常用查询的相似性结果

内存使用优化 #

内存向量存储适合小到中等规模的数据集。对于大规模部署，建议：

使用持久化向量存储（如Chroma、Weaviate）
实现分页加载机制
考虑内存映射文件技术

LLM调用优化 #

设置合理的最大令牌数限制
使用流式响应减少延迟
实现请求队列和限流机制

故障排除指南 #

常见配置错误 #

1. 缺少必需组件 #

问题: retriever is required for basic RAG 原因: 未设置Retriever组件 解决方案: 在配置中提供有效的VectorStoreRetriever实例

代码路径: [prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L127-L129)

2. LLM未正确配置 #

问题: LLM is required for basic RAG 原因: 未设置LLM组件或配置错误 解决方案: 提供有效的llms.Model实例

代码路径: [prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L130-L132)

3. 向量存储为空 #

问题: no documents in vector store 原因: 向量存储中没有添加任何文档 解决方案: 确保在向量存储中添加了至少一个文档

代码路径: [prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L139-L141)

运行时错误处理 #

1. 嵌入生成失败 #

错误类型: Failed to generate embeddings 可能原因: 文本编码问题或嵌入模型配置错误 解决方案: 检查输入文本格式和嵌入模型设置

2. 相似性搜索超时 #

错误类型: SimilaritySearch timeout 可能原因: 大量文档导致搜索时间过长 解决方案: 减少TopK值或优化向量存储结构

3. LLM响应失败 #

错误类型: generation failed 可能原因: API密钥无效、网络问题或模型不可用 解决方案: 验证LLM配置和网络连接

章节来源

[prebuilt/rag.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag.go#L127-L132)
[prebuilt/rag_components.go](https://github.com/smallnest/langgraphgo/blob/main/prebuilt/rag_components.go#L139-L141)
[examples/rag_basic/main.go](https://github.com/smallnest/langgraphgo/blob/main/examples/rag_basic/main.go#L74-L81)

结论 #

LangGraphGo的基础RAG实现提供了一个简洁而功能完整的框架，用于构建基于文档检索的问答系统。其主要优势包括：

设计优势 #

模块化架构: 清晰的组件分离使得系统易于理解和扩展
类型安全: 强类型的RAGState确保数据完整性
灵活配置: 丰富的配置选项满足不同应用场景需求
测试友好: 完善的测试覆盖和模拟组件支持

应用场景 #

知识库问答: 构建企业内部知识管理系统
文档检索: 快速查找相关文档内容
智能客服: 基于产品文档的自动回答
学术研究: 文献检索和摘要生成

扩展建议 #

对于生产环境部署，建议考虑以下扩展：

持久化存储: 使用外部向量数据库替代内存存储
分布式架构: 支持多节点部署和负载均衡
监控告警: 实现性能监控和异常告警机制
版本控制: 对文档和配置进行版本管理

基础RAG实现为开发者提供了一个良好的起点，通过理解其架构和工作原理，可以为进一步的功能扩展和优化奠定坚实基础。